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Abstract. The goal of this note is to show that some convolution type inequalities from 
Harmonic Analysis and Information Theory, such as Young's convolution inequality (with 
sharp constant), Nelson's hypercontractivity of the Hermite semi-group or Shannon's in- 
equality, can be reduced to a simple geometric study of frames of R 2 . We shall derive 
directly entropic inequalities, which were recently proved to be dual to the Brascamp-Lieb 
convolution type inequalities. 



1. Introduction 

The topic of Brascamp-Lieb and convolution type inequalities was recently renewed by 
Carlen, Lieb and Loss [10] who proposed a semi-group or heat flow approach to these inequal- 
ities. (Soon after Bennett, Carbery, Christ and Tao [7j independently gave a semi-group ap- 
proach to multidimensional Brascamp-Lieb inequalities.) Carlen, Lieb and Loss also obtained 
new inequalities on the sphere, and in particular a subadditivity of the entropy inequality. It 
was noted in [5j that this inequality can be proved using geometric properties of the Fisher 
information of the marginal distributions. Pushing forward these investigations, Carlen and 
Cordero-Erausquin [9] derived a similar geometric treatment for general subbaditivity of the 
entropy inequalities on Euclidean space, and proved that these inequalities are dual to the 
Brascamp-Lieb inequalities. The semi-group approach was recently carried out in the unifying 
framework of abstract Markov semi-groups in [3]. 

The abstract geometric argument in R n is particularly simple and for self-consistency, we 
describe it in details in this introduction. 

In the sequel, fi will stand either for the Lebesgue measure on R or for the standard Gaussian 
probability measure 7 on R (with density (27r) _1//2 e~' i ' / 2 ). Consequently, \i n := /x® n will 
stand for the Lebesgue measure or the standard Gaussian measure 7 n on R ra . It is convenient 
to treat these two cases in parallel although it is possible to derive formally one from another. 

Say that / is a probability density with respect to (w.r.t.) fi n if / : R n — ► R + is such 
that J f diJ, n = 1. Given a random vector X S R n with / as probability density w.r.t. \i n 
(a relation written below as X ~ f dfi n ), its entropy w.r.t \i n is defined (whenever it makes 
sense) by 



S„ n (X) := S^if) := [ f log fd» n . 



In the case \i n is the Lebesgue measure we shall use the notation S(X) = S(f) = f /log/. 
All along the paper, it will be implicitly assumed in all statements that we consider only 
densities and random vectors with well defined and finite // n -entropy. 

If / is probability density w.r.t. [x n on R n and a £ R n is a fixed non-zero vector, denote 
by /( a ) the marginal probability density w.r.t. \i on R, i.e. /( ) dp, is the image of / d/x under 
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the map x — > a ■ x. Thus, fr a \ is characterized by the requirement that 
(1) / (j>(x-a)f{x)dix n {x)= f <f>(t)f ia) (t)dm(t) 



for every bounded measurable (p : R — * R. Equivalently, if X ~ / dfj, n , then /( a ) d\x\ is the 
density of X-a, thatisX-a ~ f( a )dfJ>i- Thus, S^(X-a) = 67, (/( a )) = J R /( a )(t)log/ (a )(t)<i/i(t). 
The classical subadditivity of entropy (usually stated with the Lebesgue measure) indicates 
that for an orthonormal basis (u±, . . . , u n ) of M. n and a random vector X, 

n 

(2) ^5 M1 (X- Ui )<5 M „(X). 

i=l 

The relation between subadditivity inequalities and Brascamp-Lieb inequalities is summarized 
in the following proposition. 

Proposition 1 (|9j). For non-zero vectors a\, . . . , a m £ R n , c% . . . , c m > and D £ WL, the 
following assertions are equivalent: 

(1) For every f h . . . , f m : R -»■ R+, / TT^^ " a *) Cl < e D TT / • 

ra 

(2) For every random vector iGl",^ Cj • a^) < ^(X) + D. 



We also have a complete equivalence between the equality cases. This result is easy to 
prove (actually it holds in much more general settings): it formally relies on the fact that the 
the Legendre transform of the entropy functional is the functional V — > log f e v d/j, n , that is 

log J e v dfi n = sup | J fVdfi n - S Mri (/)j and (/) = sup j J fVdfx n - log J e v d/i n ^ , 

and on how this combines with ([T]). The dual of the subadditivity inequality ([2]) is nothing 
else but Fubini's theorem. 

It is possible to consider more general geometric situations than the one of an orthonormal 
basis (J2]). Of particular interest is the case of a decomposition of the identity, as put forward by 
Ball in the context of Brascamp-Lieb inequalities (see e.g. |T|). Given unit vectors u\, . . . , u m 
in the Euclidean space M n and real numbers ci, . . . , c m > 0, we say that they decompose the 
identity if 

771 

(3) S ^c i u i ®u i = Id K n 

7=1 

where U{ <8> itj stands for the orthogonal projection in the direction U{. Note that necessarily 
Cj < 1 and 

771 

(4) ^2a = n. 

7=1 

It is easy to derive sharp subadditivity entropy inequalities using ([3]) because such de- 
compositions combine nicely with the Fisher information, as noted in [3]. The point is that 
the Fisher information has an L? structure which allows for geometric operations such as 
projections (or equivalently conditional expectation). A random vector X £ R n with / as 
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probability density w.r.t. fi n is said to have finite /i„-Fisher information if the following 
quantity is well defined and finite: 

W*):=I^(/):= / l ^d^ n . 

Jm." J 

It follows from the Cauchy-Schwarz inequality (see 0[9]) that for a unit vector u G M n , 

r (X7f. u ) 2 

(5) / w (/(„)) = / w (X • «) < / 1 , J dfi n , 

with equality if and only if X ■ u and X — (X ■ u)u are independent. If we are given a 
decomposition of the identity, then, rewriting ([3]) in the form 

m 

(6) WeR n , °i ( v • u if = M 2 > 

i=l 

we immediately get from ([5]) that for any random vector X with finite Fisher information, 

m 

(7) J>/ M (X- Ul )<^pO- 

i=l 

In order to get an inequality for entropy, we integrate along the suitable semi-group. Let L 
stand for the differential operator Lf = Af (Laplacian) when \x n is the Lebesgue measure, 
and Lf = A/ — x- V/ in the case \i n = j n . Let Pt = e tL be the corresponding heat semi-group 
and Ornstein-Uhlenbeck semi-group, which admit as invariant measure the Lebesgue and the 
Gaussian measure respectively. If X is a random vector with density / with respect to \x n and 
finite ^ n -entropy, then e tL f is a smooth probability density with respect to fi n , with finite 
/U n -Fisher information, and 

^ t S, n (e tL f) = -I, n (e tL f). 

Moreover, L (and thus e tL ) has the property that it preserves the algebra of functions of the 
form f{x) = g(x • u), a property also used in the proof of ([5]). This ensures the following 
crucial property, namely, for every t > 0, 

(8) (e tL f) (u) =e tL (f (u) ) onR, 

where we used the same notation for the one-dimensional and n-dimensional semi-groups. 
The heat and Ornstein-Uhlenbeck semi-groups are the two most important diffusion semi- 
groups sharing property ([8]) , and this explains the particular role played here by the Lebesgue 
and Gaussian measures. 

Now, integration of ([7|) along the semi-group e tL leads to the inequality Y^ILi °i S^,{X-Ui) < 
S fln (X). From the cases of equality in ([5]) we get that equality holds if and only if for each 
i < m, X • u-i and X — (X • U{)ui are independent (a property which is preserved along the 
semi-group). Under mild conditions on the vectors Uj, it is easily seen that this can happen 
only when X is a Gaussian vector (see [9] for details). The previous discussion is summarized 
in the next proposition, established in [9]. 

Proposition 2. Consider a decomposition of the identity ([3]) in W 1 Then, for all random 
vectors X £ M. n , 

m 

(9) ^CiS^X.ujKS^X). 

i=l 
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Furthermore, under the condition that no two of the unit vectors {u{} are linearly dependent, 
and that if any one vector m is removed from {u±, . . . ,u m }, the remaining vectors still span 
W 1 , equality holds in if and only if X is a Gaussian random variable whose covariance is 
a multiple of the identity. 

In view of the duality given by Proposition [H we recover from the previous inequality Ball's 
form of the Brascamp-Lieb inequality: for every fi,...,f m ■ R — > M + , 

„ m m , . v Cl 

(10) / Hfiix-u^d^x) < H / /idMj • 

Moreover (under the same hypothesis on the Uj's as in the previous proposition), equality 
holds if and only if the measures /, dfii are Gaussian with the same covariance: fi(t) dfi(t) = 

we restrict to centered functions). 
Note that using © it is also possible to pass from inequalities for the Lebesgue measure 
to inequalities for the standard Gaussian measure (and vice versa) by the correspondance 

/««— >/i(t)e-" |a/2 . 

The goal of this note is to prove the efficiency of the theoretical aforementioned approach 
in some meaningful situations. More precisely, we aim at understanding convolution and 
information theoretic inequalities, such as the sharp Young convolution inequality, Nelson's 
hypercontractivity of the Hermite semi-group, or Shannon's inequality, as functional forms 
of some particular decompositions of the identity of M. 2 . To do so, we will study, in the next 
section §2, the decompositions of the identity of M 2 by three vectors. We give a complete 
description of the relation between the coefficients q and the vectors U{ in this case. As a 
consequence, we obtain the following general inequality which can be viewed as the functional 
form of Proposition [5] below. 

Theorem 3. Let Pi,P2,P3 > I be such that 

1 1 1 
- + - + - = 2, 

Pi P2 P3 

and 62,63 G]0, 7r[ be defined by 



(cos(0 2 ), sin(0 2 ))= ( V(Pi -1)(P2 

and 



(cos(0 3 ), sin(0 3 ))= " V(Pi ~ 1)(P 




P1P2 (P3 ~ 1) 
P3 



Pi P3 (P2 



P2 

Then, for every random variables 1,7 6l, 

(11) —SJX) + —SJcos(e 2 )X + sin(6 2 )Y) + — SJcos(e 3 )X + sm(e 3 )Y) < S^(X,Y). 

Pl P2 P3 

with equality if and only if X andY are independent identically distributed Gaussian variables. 
Equivalently, for every functions g G L P2 (n) and h £ L P3 ({i), setting p\ = ^rj; 



(12) 



g(cos(6 2 )x + sin(0 2 )y) h(cos(0 3 )x + sin(0 3 )y) dfj,(y) 



L p i(dij,(x)) 
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with equality if and only if f and g are of constant sign with \g(t)\ P2 d/j,(t) = K 2 e A ^ a2 * )S dt 
and \h(t)\P 3 dfi(t) = K 3 er^*-^) 2 'dt for a 1 ,a 2 € M and K 2 ,K 3 ,X> 0. 

We apply this result in section §3 to the determination of the sharp constant in the Young 
convolution inequality. For this, we will work directly with the entropy and exploit the 
following simple but useful invariance of the entropy (in the case of Lebesgue measure) under 
linear transformation: for a random vector X G M. n and an invertible linear operator A on 

R n , 

(13) S(AX)=S(X)-log(\det(A)\). 

In section §4 we derive the Shannon inequality from a limit of decompositions of the identity. 
The hope is to shep in this way new light on the connection between subadditivity of entropy 
and Shannon's inequality. We study similarly Gaussian inequalities in section §5 and derive in 
particular hypercontractity of the Hermite semi-group (and the associated logarithmic Sobolev 
inequality). For simplicity, we consider here only one-dimensional inequalities. In the last 
section §6 we briefly explain how to extend word by word the approach to multi-dimensional 
situations. 



2. Decomposition of the identity of R 2 

As announced, we investigate here decompositions of the identity of R 2 . A decomposition 
of the identity with only two vectors u\ and u 2 holds if and only if these two vectors form 
an orthonormal basis and c\ = c% = 1. In order to get something of interest, consider the 
case of three distinct unit vectors in R 2 , u\, u 2 and U3. Note that itj <S> Uj = {—Uj) (—Ui) so 
here and in the sequel, 'distinct' really means that the directions Ruj are distinct. The first 
question we address is the following: if the directions are given, can we find positive numbers 
ci,C2,C3 such that the decomposition of the identity 



(14) C\ U\ ®U\ + C 2 U 2 ®U 2 + C3U3 <g)U3 = 

holds? The answer is yes provided the vectors are 'well enough' distributed in space. 

Proposition 4. Let Rui, Hu 2 andM.113 be three distinct directions 0/R 2 . There exists three 
positive numbers c\ , c 2 and C3 such that the decomposition of the identity (| 14|) holds if and 
only if the three geometric angles given by the six angular sectors defined by these directions 
are all strictly smaller than ^. The Ci 's are then given by 

< 15 > « = 5mft ro 1U^-s,) = 1 - cot< "- " V> cMih - 6l) 

for (i,j,k) a permutation of (1,2,3) and ui = ( cos(6*j), sin(#j)) fori = 1,2,3. 



Proof. If two of the vectors are orthogonal, say u\-u 2 = 0, and if (jT4j) holds, then = u\ - u 2 = 
+ + C3 (U3 • u\) (us ■ u 2 ) and therefore, if C3 > then U3 ■ u\ = or U3 • u 2 = 0. But this 
implies M3 = ±u 2 or u% = ±ui, which is excluded. A genuine three vector situation cannot 
contain a two vector situation (which is equivalent to an orthonormal basis). 

By the assumption, the projection u\ <g) u\, u 2 (& u 2 and U3 (g> U3 span 1S2, the 3-dimensional 
space of symmetric operators on M?. Therefore the linear operator 

(Cl, C 2 , C 3 ) > C\Ui (g> Ul + C 2 U 2 (g) U 2 + C3 M 3 (g) W 3 
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is an isomorphism from M 3 onto S2 . Write U{ = (cos(0j), sin(#j)J for i = 1,2,3, where 
all coordinate computations are done in the canonical orthonormal basis of M?; note that 
$i - 6j / 0[f]. Using that 

cos(6*j) 2 cos(0j) sin(6*j) 
cos(^) sin(^) sin(6'i) 2 

it is readily checked that the unique solution in M 3 of (|14p is given by 



Ui <g> Ui 



_ cos(e 2 -e 3 ) _ cos(e 3 -ei) _ cos(0i-6> 2 ) 

Ll sm(6» 2 -6»i)sin(e3-6»i) ' ° 2 sm(8 3 -8 2 ) sin(0i-0 2 ) ' 3 sin(0i-0 3 ) sin(0 2 -6» 3 ) ' 

It remains to identify when this gives a solution to our problem, i.e. when ci, 02,03 > 0. 

All the quantities in the previous equation remain unchanged if we replace some 0{ by 
0i + 7T, which is consistent with the fact that we have been working with directions only. And 
of course, they also remain inchange by rotations, i.e. by 0{ — > 0i + a for i = 1, 2, 3 and oel. 
Therefore, up to a relabeling of the directions, we can assume that = 0\ < 62 < #3 < vr. 
Then the c^s are positive if and only if 

e 3 -6 2 < vr/2, 3 - > vr/2, # 2 - < vr/2. 

Rewriting the second condition as ir — O3 < ir/2, we get the announced condition on the three 
angular sectors 62, 03 — 02, tt — #3. □ 

We now investigate the converse procedure. Given three numbers ci, 02,03 G (0,1) such 
that 

(16) ci + c 2 + c 3 = 2, 

we would like to know whether is possible to find directions u\ = (cos(#j), sin(#j)) for which 
the decomposition of the identity (I14p holds. The answer is yes, and the construction is 
unique up an isometry of IR 2 (which clearly preserves decompositions of the identity). 

Proposition 5. For given 01,02,03 G (0,1) satisfying (|16[) . t/iere exists a triple of directions 
Mu\, Ru2 and Mu 3 unique up to isometries such that the decomposition of the identity (|14p 
holds (the solutions Ui = (cos(0j), sin(0j)) are gwen 6y equation (fT8l) below). More explicitly, 
all solutions are obtained by performing an isometry on 
(17) 




Cl C2 V Cl C 2 / \ V c l c 3 V Cl C3 

Proof. Inverting formally (|15p we get 



oot(fl a -g 3 )=eV <1 ~?lgr g8) , cot(g 3 -6>i)=e ) / (1 "?-Sr Cl) 



(1-Ci)(l-C2) 



(18) cot(e 1 -e 2 )=eJ 

and e = ±1. This uniquely determines the directions Mut up to isometries. To check this, 
first perform a rotation ensuring that 0\ = and # 2 ,# 3 G (0,7r). We still have an invariance 
by symmetry with respect to the coordinate axis x = 0, which corresponds to the sign of e. 
Thus, without loss of generality, we can impose 

(19) = 9i < 02 < 03 < 7T. 
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The last two equalities in (fT8|) give that cot(# 2 ) and cot(#3) are of opposite sign, and thus, 
by (fl9j) . e = —1, 6> 2 G (0,7r/2) and #3 G (tt/2,tt), these angles being uniquely determined by 



(20) cot^) = v (l "Ti ( c 3 " c2) and cot(^)=-v (i ' c i K c i r i) - 

Note that the first equality of (|18|) ensures that #3 — # 2 G (0,7r/2), which is consistent with 
the condition in the previous theorem. So long for unicity. It remains now to check that we 
indeed get a solution. Equivalently, back to the situation (fT9]) - ([20]) . we need to check that 
once the last two equalities from (fTBj) are used to uniquely determine the angles 62 and #3, 
the first equality of (|18p is then automatically verified. This is indeed the case since 



(l-C2)(l-c 3 ) X COT ^ 3 - y (l-caXl-cs) X cot(O 2 )-cot(0 3 ) _ (l-c 2 )+(l-c 3 ) • 

Therefore the compatibility condition (I16p yields the desired equation. Finally, the solu- 
tion (fl9l) - (j20"1) rewrites as (fTTl) in coordinates and the proof is complete. □ 

We can now derive the main Theorem stated in the introduction. 

Proof of Theorem^ Introducing, for i = 1, 2, 3, q = — , the unit vectors U{ = (cos(0j), sin(#j)) 
of the previous proposition can be rewritten as u\ = (1, 0), 



U 2 - 



(VWl - 1)(P2 - 1), V " 1 " 2 ^'^ ) > « 3 =(-V(Pl-l)(P3-l), \/ 



Pi P3 (P2~l) 



P2 



Then the result for entropy follows from Proposition [21 Next note that it is enough to 
prove (|12p in the case of nonnegative functions, and therefore the integral inequality to be 
proven is 

J J f(x) g(cos(9 2 )x+sm(e 2 )y) h(cos(e 3 )x+sm(e 3 )y) dfi(x)dfi(y) < ||/||lpi Ibllx^^) \\h\\ L P 3 (fJ>) 

for f,g,h : R — > R + . But this inequality holds as dual of inequality (lllh by virtue of 
Proposition [TJ The cases of equality follow from the general considerations given in the 
introduction. □ 



3. Sharp Young's convolution inequality 

Here we work with the Lebesgue measure. Let p, q, r > 1 be such that 

111 
21 _ + _ = ! + _, 

p q r 

which can be rewritten as 

1 1 1 

- + - + - = 2. 

r p q 

Apply then Theorem [3] with p\ = r' , P2 = P , P3 = q- The angles 02,63, or equivalently the 
unit vectors Uj = (cos(0i),sin(0j)), i = 1,2,3, given by Theorem [3] are 



(22) n 1 = (l,0), u 2 =[ V(r'-l)(p-l), , u 3 = ( - V(r' - l)(q - 1), 



Therefore, for any random vector (W, iT) G R (with finite entropy), 

(23) ^5(W) + -S(cos(0 2 )W r + sin(0 2 )Z) + - 5(cos(6» 3 )W + sin(0 3 )^) <S(W,#). 



A 
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We would like to have as random variables in the left-hand side multiples of W, W — Z and Z, 
respectively. To this task, first perform a linear transformation leaving W invariant so that 
the last variable is a multiple of Z, and then a diagonal linear operator so that the second 
one is a multiple of W — Z. Readily, perform the linear transformation (X, Y) = A(W, Z) 
with 

cot(0 3 ) 1 

COt (0 3 ) - COt (0 2 ) 

Then, using (fl~3j) . it follows that (f23|) is equivalent to the following subadditivity inequality: 
for every random variables X and Y, 

^ S ( ,(^ Y ) + " S (M02)(X -Y))+- S{ sin(0 3 ) Y) 

r cot (03 ) — cot (02 J p Q 

< S(X, Y) + log | cot(0 3 ) - cot(0 2 ) | . 
This inequality is also equivalent, using again the scaling of entropy (in dimension 1) to 

(24) 1 S(X) + l ~S(X- Y)) + 1 S(Y) < S(X, Y) + D 

with 

D = fl-^- N )log|cot(03)-cot(02)| + -logsin(02) + -logsin(0 3 ). 
\ r / P Q 

Using that ^ + jr = p, it follows that cot(0 2 ) - cot(0 3 ) = jr\f^f and 

D = log y/r + — log \fr' + - log Jp log + - log Jq log \ftf. 

r r p p' q q 



For t > 1, set C t := J -p^p- where as before t' is the conjugate of t. We have thus derived the 
following classical result. 

Theorem 6 (Sharp Young's convolution inequality). Letp,q,r > 1 satisfy (f2Tj) . For every 
random variables X,Y £M, 



i S(X) + I S(X - Y) + I S(Y) < S(X, Y) + log ( 



~c7 



Furthermore, the inequality is sharp: equality holds if and only if (X, Y) G M 2 is a Gaussian 
vector whose covariance matrix is a multiple of A* A. 
Equivalently, for every f G LP(U.) and g G L q (R), 

C C 

( 25 ) 11/ * s'IIl'-(r) < v n II/IIlp(m) IIsIIl^r). 

For the equality cases in the entropic inequality, note that in view of Proposition [2] equality 
holds in f)23|) if and only if (W, Z) is a Gaussian vector with covariance a multiple of Idjj2 , and 
(X, Y) = A(W, Z). Next, note that Young's inequality ([25]) reduces to the case of nonnegative 
functions, which is then equivalent to following dual form of the entropic inequality: 

ff C C 

(26) // f[x)g(x -y)h(y)dxdy < H/H^r) ||5||w(m) II#IIl?(k) 

for every nonnegative functions f,g,h:M.—> M + . It is possible to deduce the equality cases in 
this inequality (and therefore in Young's convolution inequality) from the ones in the entropic 
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inequality, as described in [9] (we get some well chosen Gaussian functions) . Actually, it is also 
possible to use that inequality ([26]) is obtained by rescaling the functions (after the change 
of variables (x,y) = A(w,z)) in the Brascamp-Lieb inequality dual to ([23]) . where equality 
holds if and only the functions are Gaussian with the same covariance. 

The sharp Young convolution inequality ([25]) was obtained independently by Beckner [5] 
and Brascamp and Lieb [6] . Their proofs rely on rearrangements of functions and tensorization 
arguments. Barthe [2 J gave a new (simpler) proof using a mass transportation argument. One 
of the advantage of the geometric entropic approach used here is that it makes it possible to 
extend it to other contexts. 



4. Shannon's inequality 

We continue to work with the Lebesgue measure. We now aim at reproducing the following 
classical result in Information Theory (see [11] for details). 

Theorem 7 (Shannon's inequality). Let X and Y be two independent random variables. 
Then 

X + Y \ S(X) + S(Y) 
V2 J ~ 2 

It is well known that in the case Y (say) is symmetric Y ~ —Y, then Shannon's inequality 
follows from the classical subadditivity of the entropy ([2]) since 

25 (*jf) - s fir) + s &f) £ s< *- Y > = sm + siY) 

where the last equality expresses the independence of X and Y. However this situation is 
misleading since in the general case Shannon's inequality seems to be different in nature than 
an inequality of subadditivity of entropy. One of the obstacle is that we would like to use a 
decomposition of the identity with the basis vectors e\ and e 2 together with ei ^ 2 . But this 
is not possible. We shall instead approximate such a situation. 

Recall that if G stands for a standard Gaussian variable independent of all the variables 
considered here, then if X has finite entropy, S(X + e G) — > S(X) when e — * 0. Therefore we 
can restrict our study to the case where X and Y have smooth densities (with sub-gaussian 
tails). For such regular variables, it is well known that when e — > 0, 

(28) S(X + eY) = S(X) + 0{e 2 ). 

Using the notation u(9) = (cos(0), sin(0)), introduce for fixed s S (—^,0) (s will later tend 
to 0~) the unit vectors 

m = u(s), u 2 = u(j), u 3 = u(f-s). 

These vectors define three directions satisfying the assumption of Proposition^ Let c\ , C2 , C3 G 
(0, 1) be the associated coefficients for which there is a decomposition of the identity so that, 
by Proposition [21 

(29) ci S{ cos{s)X + sin(s)Y) + c 2 + c 3 S( sm(s)X + cos(s)Y) < S(X, Y) 
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for every random variables X and Y with finite entropy. By Proposition [H as s — > , 

cos(s) , , cos(£ — 2s) 

ci = -t^ rT = l + 2 S + o S ) c 2 = \ ' ( 7r \ = -4fi + o (a ), 

sm(| — s) sin(-| — 2s) sm(-| — s) sm(s — j) 

cos(f — s) 
C3 = ■ ,„ r p > = 1 + 2s + os. 

sm(2s — ^) sm(s) 

Note also that when X and Y are (regular enough) random variables and s — > 0~ we have, 
in view of <\28h . that 

5(cos(s)X + sin(s)y) = s(x + -^4y) -logcos(s) = S(X) + o(s) 

\ cos(s) / 

and similarly 5(sin(s)X + cos(s)y) = S(Y) + o(s). Therefore, making a Taylor expansion 
in (|29p when X and y are independent and s — > 0~, it follows that 

S'(X) + S(Y) -s[aS(^) - 2S(X) - 25(y)] + o(s) < 5(X,y) = S(X) + 5(y). 

The first order in s < gives the desired Shannon inequality. 

Remark 8. In view of the duality between entropy and Brascamp-Lieb inequalities, one could 
wonder if the Shannon inequality admits a dual form. However, if we start with the Brascamp- 
Lieb inequality, in the same situation as above, and perform the Taylor expansion there, then 
we end up again with the Shannon inequality. We are in a situation where the entropic and 
Brascamp-Lieb inequalities coincide at the first order. 

It should be noted that one can prove along the same lines the Blachmann-Stam inequality 
(cf.HU): 

(so) /x + yw(* ) + i(y) 



V2 J ~ 2 

for independent random variables with finite Fisher information. Indeed, the decomposition 
of the identity obtained above and Proposition [2] and ([7]) give the result once it has been 
noted that for regular enough random variables, as for entropy, I(X + e Y) = I(X) + 0(e 2 ) as 
e — > 0. Moreover, for independent random variables I(X, Y) = I{X) + I(Y). Note that by the 
scaling of information, inequality (|30|) is commonly rewritten as / (X + Y) < I(X) + I(Y). 

Finally, we would like to mention that we could as well have started from Theorem [3l A 
first order Taylor expansion when p2 = P3 — » 2 (and therefore p\ — > 1) in (jlip gives again the 
Shannon inequality. Having derived the sharp Young inequality from (jlip . this procedure is 
reminiscent of Dembo's proof of Shannon's inequality which consisted in a Taylor expansion 
in (j25j) around p = q = 2 (see [TTJ ) . 

5. HYPERCONTRACTIVITY AND LOGARITHMIC SOBOLEV INEQUALITY 

In this section, the measure fj, = 7 will be the Gaussian measure and fj, n = j n . 
Assume we are given p, q with 

(31) 1< p < q 

and set 9 E [0, |) such that 



(32) cos(0) ' 
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Write as before t' = t/(t — 1) for t > 1. Since q > p, let r G [1, q) be such that | = i + or 
equivalently 

Then, introducing £ £ (f ) 7r ) such that 



1 1 1 

- + - + - = 2. 

q p r 



(33) cos(e) = _yi_l, 

the angles are exactly the ones associated to the triple 

Pi = q' , P2 = P , P3 = r 
in Theorem [3] (6\ = 9, 62 = £)• Consequently, for every random variables X, Y, 

(34) jS y (X) + -S , 7 (cos(0)X + sin(0)y) + ~S 7 ( cos(£)X + sin(£)K) < S 72 (X,Y). 



We emphasize in the next proposition the corresponding convolution inequality (|12|) of inde- 
pendent interest (as a stronger statement than the classical hypercontractivity) . 

Proposition 9 (Hypercontractivity). Letp,r>\. 7/1 + 1 = 1 + 1 and (|52 ]1 -Ifgg j) /10/cZ, i/ieri 
/or every functions f £ L p {^) and g £ L r {^), 



/(cos(0)x + sin(0)y) g{ cos(£)x + sin(£)y) cfy(y) 



L9(d7(a;)) 



iz;ii/j equality if and only if f,g are of constant sign with \f(x)\ p d'j(x) = K\ e x \ x ai ^dx and 
\g{x)\ T d~i{x) =K 2 e- x \ x - a2 \ 2 dx, K U K 2 ,\> 0, 01,02 € M. 

Indeed, this result contains the hypercontractivity inequality for the Hermite semi-group 
P e (f)(x) := / /(cos(0)x + cos(%)d 7 (y). 



(One may work as well with the Ornstein-Uhlenbeck semi-group Ptf := e tL f = -P a rccos(e-')/)- 
Proposition [9] applied with g = 1 namely indicates that under (f3"Tj) - (f32j) . 

(35) V/e^), \\Pef\\ L «(y)<\\fhp( 7 )- 

Equality holds iff / is exponential, f(x) = Ke~ ax (A = 1/2 since g = 1 in the Proposition). 

If we rather work at the level entropic inequalities, first note that for every random variable 
Z, S^(Z) > 0, with equality if Z is a standard Gaussian variable. Therefore, inequality ([34 
implies that, for every random variables X, Y, 

(36) -^SJX) + -S 7 ((cos(0)X + sin(0)Y) < S y JX,Y), 
q' p ' 



which is dual to the following Brascamp-Lieb inequality, equivalent to (|35p : for every functions 
g,f :K^R+, 

5(2;) /(cos(0)a; + sm(0)y) dj{x)d-y(y) < \\f\\ L p( 7 ) ||ff[lx 9 '( 7 )- 
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Remark 10 (Particularity of the Gaussian case). It is worth noting that in the Gaussian 
case, inequalities Q and fjlOf) hold under the weaker condition 

m 

(37) a m (g> m < IdRn . 

i=l 

(So here we can simply use that hex®e\-\-^ ) u{9)®u{9) < Id^a, instead of 'forgetting' terms, as 
we did above.) The reason is the following (this was also noted in the spherical case in [3] and it 
is in fact a general feature when working on a probability space). From the explanations given 
in the introduction, it is clear that (|37j) is always sufficient to get (j2J) . When integrating along 
the Heat semi-group, it is necessary to rescale in order to obtain asymptotically a standard 
Gaussian. So the condition is there crucial (see (SJj. But in the Gaussian case, there is 
no need to rescale when integrating along the Ornstein-Uhlenbeck semi-group, and so we can 
indeed derive inequalities © and (fTOl) from ((371) . Alternatively, starting from ((371) . we can 
complete the self-adjoint operator on the RHS in order to get a decomposition of the identity, 
by adding some unit vectors u'j and coefficients c'j . We then simply apply (jSJ) and ((ID]) in the 
case of this decomposition, but with Xj = G (standard Gaussian independent of the rest) and 
fj = 1, respectively, for the added indices. 

It is well known that the Gaussian logarithmic Sobolev inequality is equivalent to the 
hypercontractivity inequality f)35[) [12] . To derive the logarithmic Sobolev inequality, one can 
differentiate (|35|) at 9 = 0. Let us explain how it is even easier to see this implication when 
working with the dual entropic form. Recall that for every random variable Z, Sj(Z) > 
with equality if Z is a standard Gaussian variable. Let X be a random variable and G be a 
standard Gaussian variable independent of X. For 6 £ [0, ^), set 

PgX := cos(9)X + sm(9)G. 

Then, inequality (f36j) gives that, under (f3T|) - ([52]) . 

^jSJX) + -S^(PeX) < S r2 (X, G) = SJX) 
q' p 

p 

which rewrites as S^(PqX) < -Sj(X). Therefore, for any 9 £ [0, |) and q > 1, by picking the 
appropriate p verifying ([36]) . S(PgX) < jlfgg ^ S(X). Letting q — > +oo, 

S^PeX) < cos 2 (#)S 7 pO 

which is the well known integrated form of the logarithmic Sobolev inequality. Indeed, since 
there is equality 9 = 0, the 9 2 order term gives the logarithmic Sobolev inequality 

S 1 [X)<\l 1 iX) = -^S 1 {PeX). 

6. Higher dimensional inequalities 

We have studied convolution inequalities for functions on M. and for random variables. But 
the strategy applies word by word to convolution inequalities for functions on R n and random 
vectors. Let us briefly explain why. 

All subspaces of M. n are equipped with the Euclidean structure inherited from the standard 
Euclidean structure on W 1 . Accordingly, for a subspace E C W 1 , the measure he will stand for 
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the Lebesgue measure or the standard Gaussian measure on E. Denote by Pg the orthogonal 
projection in M. n onto E, and if / is a probability density w.r.t. fj, n , denote by f(E) the 
probability density w.r.t. [ie which is the image of fdfi under the map Pg- For every random 
vector I £ I", we have that X ~ f dfi PeX ~ f( E ) d/J-E and 



g{y)f{E){y)dnE{y) = I g{PEx)f{x)dii n (x). 

IE JR n 
The following analogue of (|5|) immediately holds: for / a (smooth) probability density w.r.t. 
/U n (or any random vector with X ~ fdfi n ), 

I^V/| 2 
-j d/i n . 

Assume we are given a collection of subspaces Ex,...E m C R n and of positive numbers 
ci , . . . , Cm > such that 



(38) I, E (f(E)) = I^(PeX) < [ 



(39) J2ciP Ei =Id Rn . 

Then, by ®-fl32J, ET= i c i I he {^EiX) < I^ n (X). After integration along appropriate semi- 
groups Pt (noting again that {Ptf)(E t ) = Pt(fEi)), we get the analogue of Proposition [2j 

m 

E^ MSj (P £i X)<^(I), 

i=l 

and by duality, for /j : E^ — > R + , i = l,...,m, the classical (multidimensional) geometric 
Brascamp-Lieb inequality 

Ci 



~ in rn / p 

/ \[r{P El x)d^ n <\{( / fid m 



The convolution inequalities on R n are obtained by using appropriate projections onto 
three n-dimensional subspaces of R 2n . Given an angle 9 S [0,7r], denote by Pg the projection 
in R 2n obtained by tensorizing the projection on the direction u{6) in R 2 : 

p d = UgU e , with Ug = (cos(0)Id Rn sin(0)Id R «) : R 2n R n . 

Identifying R 2n with R n x R n , Pg(x,y) = Ug(cos(9)x + sin(%) for x,y £ R n . The projection 
onto the first R n is Po : Po{x,y) = x. Note that the image subspace Eg := Im(Pg) is normally 
parametrized by R n as Eg = {Uq*z; z £ R n }. Therefore, for a random vector (X, Y) 6 R 2n , 

S^ Eg [Pe(X,Y)) = S Mn (cos(0)X + sin(tf)Y). 

Assume then we are given pi,P2,P3 > 1 with — + i + i = 2, and let #2,^3 be the 
angles given by Theorem [3l These angles came from the decomposition of the identity of 
Proposition [5l which extends to a decomposition of the identity of R 2n : 

—P + — Pg 2 + —Pg 3 = Id R 2n. 
Pi P2 P3 

By the previous considerations, Theorem [3] immediately extends to random vectors on 1" 
and functions on R n : for every random vectors X, Y £ R n , 

^J- V (X) + 1 5 Mn ( cos(# 2 )X + sin(0 2 )Y) + 1 5 Mn ( cos(# 3 )X + S m(6 3 )Y) < S^ n (X, Y) . 
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Similarly, for every functions /, g : R n — ► R with g G L P2 (/u, n ) and /i G L P3 (/j, n ), 

g(cos(9 2 )x + sm(d 2 )y) h(cos(6 3 )x + sin(0 3 )y) ; < HffH^^) H^IU^^)- 

L p i{dix n {x)) 

The cases of equality are also the same. Then, the multidimentional forms of Young's convolu- 
tion inequality, Shannon's inequality, Hypercontractivity and logarithmic Sobolev inequality 
are obtained by the same computations we have performed previously. 

Of course, it is also possible to derive these inequalities from the one-dimensional ones by 
standard tensorization techniques. But as pointed out earlier, the geometric approach used 
here might prove useful in some other contexts. 
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